Advanced HPC-CI Webinar Series: The TAU Performance System and E4S

Tuesday, March 11, 2025

11:00 AM - 12:30 PM PDT

This event will be held remotely.

The talk will highlight the advances in the TAU Performance System(R). The TAU Performance System [http://tau.uoregon.edu] is a versatile performance evaluation toolkit supporting both profiling and tracing modes of measurement. It supports performance evaluation of applications running on CPUs and GPUs and supports runtime-preloading of a Dynamic Shared Object (DSO) that allows users to measure the performance without modifying the source code or binary. It will describe how TAU may be used with MVAPICH MPI and support advanced performance introspection capabilities at the runtime layer. TAU's support for tracking the idle time spent in implicit barriers within collective operations will be demonstrated. TAU also supports event-based sampling at the function, file, and statement level. TAU's support for runtime systems such as CUDA (for NVIDIA GPUs), Level Zero (for Intel oneAPI DPC++/SYCL), ROCm (for AMD GPUs), OpenMP with support for OMPT and Target Offload directives, Kokkos, and MPI allow instrumentation at the runtime system layer while using sampling to evaluate statement-level performance data. It will describe the different instrumentation, measurement and analysis options that are available and how TAU is integrated in the Extreme-scale Scientific Software Stack (E4S). E4S is a curated, Spack based software distribution of 100+ HPC and AI/ML packages. A hands-on demo on AWS with the Extreme-scale Scientific Software Stack (E4S) [https://e4s.io] will be shown.

Instructor

Sameer Shende

Research Professor, University of Oregon

Sameer Shende serves as a Research Professor and the Director of the Performance Research Laboratory at the University of Oregon and the President and Director of ParaTools, Inc. (USA) and ParaTools, SAS (France). He serves as the Technical Lead of the Extreme-scale Scientific Software Stack (E4S), ParaTools Pro for E4S(TM), TAU Performance System, Program Database Toolkit (PDT), and HPCLinux projects. His research interests include scientific software stacks, performance instrumentation, compiler optimizations, measurement, and analysis tools for HPC. He received his B.Tech. in Electrical Engineering from IIT Bombay in 1991, and his M.S. and Ph.D. in Computer and Information Science from the University of Oregon in 1996 and 2001 respectively.